Opponent Modelling and Strategy Evolution in the Iterated Prisoner’s Dilemma

نویسنده

  • Daniel W. Dyer
چکیده

Learning and evolution are two adaptive processes in the natural world that have been modelled in the study of artificial intelligence in computer science. In both biology and in artificial intelligence, learning and evolution are complementary processes. The nature of the interactions between learning and evolution has been the subject of much research in scientific disciplines. Evolution of a species can promote learning in individuals and learning in individuals can guide evolution of the species. In artificial intelligence, both learning and evolution can be applied to achieve similar goals. But which is more effective for a particular problem? What if, rather than being complementary, learning and evolution were in competition? Which is more important, being smart or having good breeding? In this paper the game of the Iterated Prisoner’s Dilemma is used as a context for investigating the answers to these questions. Evolutionary techniques have been widely applied to find good strategies for the game. The evolutionary approach to finding strategies is continued here and is measured against a learning approach. Can an adaptive learning agent out-perform evolved strategies? How does evolution respond to the presence of an adaptive agent? The experiments show that opponent modelling can be very effective against evolved populations of deterministic strategies. When stochastic strategies are considered, evolution produces strategies that are able to exploit the learning agent. However, the learning agent is able to recognise that being exploited is better than retaliation in this zero-sum game and is able to achieve near-optimal performance against these non-deterministic players.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive Opponent Modelling for the IPD - OPEN

This paper describes the design of Laran, an intelligent player for the iterated prisoner’s dilemma. Laran is based on an evolutionary algorithm, but instead of using evolution as a mean to define a suitable strategy, it uses evolution to model the behavior of its adversary. In some sense, it understands its opponent, and then exploits such knowledge to devise the best possible conduct. The int...

متن کامل

Kybernet Ik a — Ma Nuscr Ipt Collective Behavior and Kin Selection in Evo- Lutionary Iterated Prisoner’s Dilemma

Some strategies can be evolutionarily stronger than others although no evolutionarily stable strategy exists in iterated prisoner’s dilemma if the long-term payoff for each player is not insignificant. Li and Kendall (2009) introduce a so-called collective strategy for evolutionary iterated prisoner’s dilemma which plays a sequence of predefined moves and then identifies the opponent according ...

متن کامل

Martin de Laat

In this study an attempt is made to create a Virtual Agent (VA) designed to exercise control in the opponent’s behavior by interaction with its human opponent in order to influence the outcome of a game. The goal is to create an artificial player succeeding to influence its human opponent in a game with no optimal strategy, one which is based around the (iterated) prisoner’s dilemma.

متن کامل

Towards Social Complexity Reduction in Multiagent Learning: The ADHOC Approach

This paper presents a novel method for classifying adversaries that is designed to achieve social complexity reduction in large-scale, open multiagent systems. In contrast to previous work on opponent modelling, we seek to generalise from individuals and to identify suitable opponent classes. To validate the adequacy of our approach, we present initial experiments in a multiagent Iterated Priso...

متن کامل

Effects of increasing the number of players and memory size in the iterated

The Prisoner’s Dilemma has become a paradigm for the evolution of altruistic behaviour. Here we present results of numerical simulations of the infinitely iterated stochastic simultaneous Prisoner’s Dilemma considering players with longer memory, encounters of more than two players as well as different payoff values. This provides us with a better foundation to compare theoretical results to ex...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004